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Abstract 

We obtain polynomial-time approximation-preserving reductions (up to a factor of 1 + e) from the prize- 
collecting Steiner tree and prize-collecting Steiner forest problems in planar graphs to the corresponding 
problems in graphs of bounded treewidth. We also give an exact algorithm for the prize-collecting Steiner 
tree problem that runs in polynomial time for graphs of bounded treewidth. This, combined with our reduc- 
tions, yields a PTAS for the prize-collecting Steiner tree problem in planar graphs and generalizes the PTAS 
of Borradaile, Klein and Mathieu [11 J for the Steiner tree problem in planar graphs. Our results build upon 
the ideas in ifTTI and the work of Bateni, Hajiaghayi and Marx |8| on a PTAS for the Steiner forest problem 
in planar graphs. Our main technical result is on the properties of primal-dual algorithms for Steiner tree 
and forest problems in general graphs when they are run with scaled up penalties. 

1 Introduction 

The Steiner tree and Steiner forest problems are fundamental and well-studied problems in network design. In 
the Steiner tree problem we have an undirected graph G = {V, E) with costs on the edges given by c: — > 
M+, and a set of terminals 5 C F. The goal is to find a minimum-cost tree in G that connects/contains the 
terminals. In the more general Steiner forest problem we are given pairs of vertices siti, . . . , s^t]^ and the 
goal is to find a minimum-cost forest which connects Sj and for 1 < i < k. Both problems are NP-Hard 
and APX-hard to approximate. These two problems have received considerable attention in the approximation 
algorithms literature. For the Steiner tree problem the very recent algorithm of Byrka et al. 1 12| gives a 1.388 
approximation and this is the best known. For Steiner forest problem the best known approximation is 2 — 1/A: 
|[D; this is obtained via a natural cut-based LP relaxation. When G is a planar graph, Borradaile, Klein and 
Matheiu ifTTI obtained a polynomial time approximation scheme (PTAsjl for the Steiner tree problem. More 
recently, Bateni, Hajiaghayi and Marx |8| obtained a PTAS for the Steiner forest problem. 

In this paper we consider the prize-collecting versions of the above problems, which have also received 
considerable attention. In the prize-collecting version of Steiner tree, we are given a root vertex r and non- 
negative penalties tt : V ^ on the vertices of G. The goal is to find a tree T to minimize the sum of 
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the edge-cost of T and the penalties of the vertices not included in T; formally, the objective is to minimize 
YleeE{T) '^(^) + Ylv^v{T) ^(^)- It is s^sy to see that the Steiner tree problem is the special case with an infinite 
penalty on the terminals and zero penalty on non-terminals. In the prize-collecting Steiner forest problem we 
have a penalty tt{uv) for each pair of vertices u,v G V. The goal is to find a forest F C E to minimize 
YleeF + not connected by F T^iuv). For the prizc-collecting Stcincr tree problem, the current best approx- 
imation ratio is 2 — (5 for some small but fixed 6 > 0, due to Archer et al. iH, and for the prize-collecting Steiner 
forest problem, the best known ratio is 2.54 due to Hajiaghayi and Jain |[20i . 

In this paper we obtain a PTAS for the prize-collecting Steiner tree problem in planar graphs; we obtain 
this result via an approximation preserving reduction from the prize-collecting Steiner tree problem in planar 
graphs to the problem in graphs of bounded treewidth. Similarly for the prize-collecting Steiner forest problem, 
we reduce the problem in planar graphs to the problem in graphs of bounded treewidth. Below is our main 
result. 

Theorem 1.1. For each fixed e > 0, there is a p(l + e)-approximation for the prize-collecting Steiner forest 
problem in planar graphs if for for each fixed integer k, there is a p-approximation for the prize-collecting 
Steiner forest problem in graphs of treewidth at most k. In particular, there is a p{l + e)-approximation for 
the prize-collecting Steiner tree problem in planar graphs if there is a p-approximation for the prize-collecting 
Steiner tree problem in graphs of bounded treewidth. 

We describe an exact algorithm for the prize-collecting Steiner tree problem in graphs of bounded treewidth: 

Theorem 1.2. For any fixed integer k, there is a polynomial-time algorithm that exactly solves the prize- 
collecting Steiner tree problem in graphs of treewidth at most k. 

Combining the reduction in Theorem 1 1.1 1 with the above theorem, we obtain the following. 
Corollary 1.3. There is a PTAS for the prize-collecting Steiner tree problem in planar graphs. 

A natural idea would be to show that there is a PTAS for the prize-collecting Steiner forest problem in 
graphs of bounded treewidth. (We do not expect an exact algorithm since the problem is NP-Hard even for this 
case [8].) This would yield a PTAS for the problem in planar graphs. However, recent work in fT\ shows that 
prize-collecting Steiner forest is APX-Hard even in series-parallel graphs, which are planar and have treewidth 
2. 

Remark 1.4. Results analogous to Theorems \l.l\ and \L2\ also apply to the prize-collecting Traveling Salesper- 
son Problem; details are deferred to a later version of the paper 

Remark 1.5. The results for planar graphs extend to graphs of bounded genus. This follows previous ideas; 
details are deferred to a later version of the paper. 

We are motivated to consider the prize-collecting problems for several reasons. First, they generalize the 
Steiner tree and Steiner forest problems. Second, the prize-collecting Steiner tree problem has played a crucial 
role in algorithms for fc-MST |17,i4J, fc-StroU 1, 13.1 and the Orienteering IHIIKM] problems. These problems 
are more difficult than the Steiner tree problem because they also involve choosing the subset of terminals 
to connect. In particular, obtaining a PTAS for /c-MST, /c-StroU or Orienteering in planar graphs appears 
to be challenging with current techniques. The prize-collecting version of Steiner tree can be viewed as an 
intermediate problem that still captures some of these difficulties in that the set of terminals to connect is not 
determined a priori. Techniques and ideas developed for the prize-collecting problems are likely to play a role 
in making progress on problems such as fc-MST and Orienteering. We note that several of these problems have 
PTASes in low-dimensional Euclidean spaces but those rely on space partitioning schemes such as those of 
Arora |3| and Mitchell |23]; planar graph PTASes for network design have been more difficult to obtain since 
there is no comparable generic scheme. Some of the technical challenges will be outlined below where we 
describe our techniques at a high-level. 
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1.1 Overview of Techniques 



Through this paper, we use OPT to denote the cost of an optimal solution to the given problem instance. Also, 
for any solution F for the prize-collecting Steiner forest (or prize-collecting Steiner tree), we use LENGTH (F) 
to denote J^e^F c(e)> Penalty(F) to denote J2uv not connected by F T^{uv), and Cost(F) to denote Length(F)+ 
Penalty(F). For the prize-collecting Steiner tree problem, we refer to all vertices with non-zero penalty as 
terminals; similarly, for prize-collecting Steiner forest, we refer to all pairs of vertices with non-zero penalty as 
terminal pairs. Finally, we assume that e < 1 and (w.l.o.g.) that all terminals in the input graph G have degree 
1 ; if this is not the case for some terminal v, simply connect it to a new vertex v' using an edge of cost 0, and 
use v' as a terminal in the place of v. 

Planar graph approximation schemes for Steiner tree flT| and forest fSl build on many ideas starting with 
the framework of Baker |5| for PTASes for planar graphs and utilizing several subsequent technical tools. 
Before describing the new technical contributions needed to handle prize-collecting problems, we first give a 
very high-level overview of the approach common to our algorithms, and the previous approximation schemes 

of in][8il 

1 . Given a planar graph G, construct a spanner, a subgraph H Q G such that there exists a solution in H of 
cost (1 + e)OPT and the total length of edges in H is /(e) OPT, for some function / depending purely 
on £. 

2. Partition the edges of H into /(e)/e sets Ei,E2, ■ ■ ■ i^'/(e)/e), such that contracting any set results in a 
graph of treewidth 0(/(e)/e) ltT6]l . 

3. Pick the set Ei of minimum total edge length; this cost must be no more than eOPT. Contract the edges 
of this set Ei, yielding a bounded treewidth graph that contains a solution of cost (1 + e)OPT. Solve the 
problem on this graph of bounded treewidth. This may not correspond to a solution in the original graph 
G; add (uncontract) edges of Ei as necessary to obtain a feasible solution in G. The total cost of Ei is at 
most eOPT, and hence the cost of the solution is at most (1 + 2e)0PT. 

We note that the most difficult step is the first, finding a spanner H. Once this is accomplished, the rest 
is somewhat standard; applying a theorem of [16 1 allows one to decompose the edge set of H into pieces, 
and a simple averaging argument implies that one of these pieces has low cost; contracting this piece yields 
a graph of bounded treewidth. Depending on the problem being considered, it may be possible to solve the 
bounded treewidth instance exactly; if not, an approximation algorithm is used. (In [11], steps 2 and 3 are 
modified slightly to obtain a more efficient algorithm; we omit details here.) For prize-collecting Steiner tree, 
it is fairly straightforward to solve the problem exactly in graphs of fixed treewidth. As mentioned before, 
prize-collecting Steiner forest is NP-Hard even in graphs of fixed treewidth; obtaining a small constant-factor 
approximation would be of interest as, from Theorem ll.il this would immediately yield an algorithm for planar 
graphs achieving a similar approximation ratio. Thus, we focus below on Step 1, the construction of the 
spanner H; we describe parts of the spanner constructions of lITTl lSll. and the modifications necessary for the 
prize-collecting variants. 

Prize-collecting Steiner Tree: We first focus on the easier case of Steiner trees and the scheme in ifTTTl . The 

basic idea to find a spanner for the Steiner tree instance is as follows. (We assume we are given an embedding 
of the input planar graph G.) The algorithm starts by computing a 2-approximate (any constant factor would 
do) Steiner tree T in G. One can use an Euler tour of T and splice open along the tour to obtain another plane 
graph G' in which the Euler tour of T is its outer face; note that the total cost of edges on this face is at most 
40PT. Now, all the terminals are on the outer face of G'; this fact is crucial. The algorithm in ifTTI then builds 

^AU the algorithms modify this approach in distinct ways; see the subsequent discussion. 
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on the work of Klein ll22l to obtain the desired spanner H: It begins with the outer face of G' (containing all 
the terminals), and adds edges of G of total cost proportional to the length of this outer face, while guaranteeing 
the existence of a near-optimal solution only using these added edges. We provide details of the construction in 
Appendix lAl 

The main difficulty in extending the above PTAS to the prize-collecting Steiner tree problem is the follow- 
ing. We again wish to find a spanner subgraph of G by starting with an 0(l)-approximate tree T and making 
it the outer face. Unlike the Steiner tree case we run into a difficulty. The approximate tree T is not guaranteed 
to contain all the vertices that are connected to the root in an optimal solution! In fact, if we knew the vertices 
that need to be connected to the root in a near-optimal solution then we can simply reduce the problem to the 
Steiner tree problem. We overcome this difficulty by proving the following. 

Theorem 1.6. There is a polynomial time algorithm that, given a prize-collecting Steiner tree instance in 
a graph G, outputs a tree T of cost 0(l/e)OPT such that there is a {1 + e)-approximate solution T' that 
connects to the root a subset of the vertices in T. 

The algorithm to achieve the above is in fact the Goemans-Williamson primal-dual algorithm for the prize- 
collecting Steiner tree problem but with modified potentials tt'{v) = |7r(v) for each ?; € y. By exploiting 
properties of the primal-dual algorithm we can prove the above theorem. We then proceed as in the Steiner tree 
case, making this tree T the outer face, and adding edges to form a spanner. We note that we cannot use an 
(approximation) algorithm for the prize-collecting Steiner tree problem as a black box in proving the above — 
it is important to rely on the properties of the primal-dual algorithm as we change the potentials. 

Prize-collecting Steiner Forest: The PTAS for Steiner forest, which is quite recent HI, requires two new 
ideas. First, as with the Steiner tree PTAS one starts with an 0(l)-approximate Steiner forest. However, this 
forest has potentially many components and one cannot apply the spanner construction idea of fTTl in making 
a single tree the outer face. At the same time one cannot directly argue that one can treat each tree in the forest 
separately; optimal (or near-optimal) solutions may connect vertices in different components of the forest. In 
iSll there is an additional key step in which the trees are grown via a primal-dual type argument and some of 
them are merged — after this step, the remaining trees are "far apart" and hence can be treated independently 
via the spanner approach of [11]. (More precisely, one obtains a collection of subgraphs Hi,H2, ■ ■ ■ such that 
J2i LENGTH(//j) < /(e)OPT. Further, if OPTj denotes the cost of an optimal solution for the terminal pairs 
in H„ then ^ . OPT^ < (1 + e)OPT.) 

A second difficulty in obtaining a PTAS is in step 3, solving the problem on bounded treewidth graphs. As 
mentioned previously, a PTAS is developed in 1 8] for Steiner forest in bounded treewidth graphs. 

As for the prize-collecting Steiner tree problem, the difficulty in the prize-collecting Steiner forest problem 
is that we do not know which terminal pairs to connect. We deal with this using an approach similar to that for 
the prize-collecting Steiner tree problem. However, the 3-approximate primal-dual algorithm for the Steiner for- 
est problem, due to Hajiaghayi and Jain f^O], is quite complex; each step requires 0(n) max-flow computations. 
It is unclear whether one can prove a theorem similar to Theorem 1 1.61 via the algorithm in [20|. We develop a 
simpler primal-dual algorithm and analysis for the Steiner forest problem that gives a 4-approximation. We use 
the structure and analysis of our algorithm to prove the following theorem. 

Theorem 1.7. There is a polynomial time algorithm that, given a prize-collecting Steiner forest instance in 
a graph G, outputs a forest F of cost 0(l/e)OPT such that there is a {1 + e)-approximate solution F' that 
connects a subset of the pairs connected by F. 

Other related work: There is a substantial amount of literature on various aspects related to the work dis- 
cussed in this paper. We refer the reader to |[T5l for an overview of the progress on obtaining approximation 
schemes for optimization problems on planar graphs; recent papers ll22l ITTl have pushed the techniques 
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further for network design problems. Prize-collecting versions of network design have received considerable 
attention following the work of Goemans and Williamson ifTSl on a primal-dual algorithm which had sev- 
eral applications to other problems such as fc-MST. Following Hajiaghayi and Jain's work on prize-collecting 
Steiner forest problem [20], there have been several other papers on prize-collecting network design problems. 
Sharma, Swamy and Williamson |[25l generalized the approach in [20] to obtain primal-dual constant factor 
approximation algorithms for prize-collecting constrained forest problems (in the framework of Goemans and 
Williamson [T8|) with submodular penalty functions. Gutner |19| gave a very simple and efficient local-ratio 
based 3-approximation for prize collecting Steiner forest problem which also applies to the generalized version 
with more than two terminals in a group; although Gutner's algorithm is quite simple, it does not seem possible 
to prove Theorems 11.61 and 1 1 . 7 1 directly via his algorithm. Constant factor approximation ratios have also been 
obtained for more general problems with higher connectivity requirements ll24ll2T]| . 

Relation to independent work in Bateni, Hajiaghayi and Marx [7| have obtained several results on prize- 
collecting network design problems in planar graphs. Our work was done independently before we were aware 
of their paper; we have since learnt that their results were obtained slightly earlier than ours. We briefly compare 
the results and techniques between their work and ours. They have an analogous theorem to Theorem 1 1.1 1 but 
they also consider a more general problem, namely prize-collecting Steiner forest with submodular penalty 
functions. They obtain PTASes for prize-collecting Steiner tree and related problems such as prize-collecting 
TSP and stroll problems, much as we can obtain via the reduction to bounded treewidth instances. They 
complement their algorithms with an interesting hardness of approximation result which shows that prize- 
collecting Steiner forest is APX-hard in series-parallel graphs which are planar graphs and have treewidth 2. 
Their APX hardness also extends to instances in the Euclidean plane. The main part of proof of the reduction 
to bounded treewidth instances in [7 1 differs from ours in some ways. While we rely on properties of a primal- 
dual algorithm for the underlying problem with scaled up penalties, they use a separate primal-dual clustering 
step on top of the trees returned by an approximation algorithm for the underlying problem; this is inspired by 
earher work of Archer et al. lH and further extended in |i8j. 

Outline: In Section |2j we give a 4-approximate primal-dual algorithm for the prize-collecting Steiner forest 
problem. In Section|3l we prove Theorem ll.il giving the complete reduction from the prize-collecting problems 
in planar graphs to their bounded-treewidth versions. This reduction has three parts: First, in Section 13.11 
we prove Theorems 11.61 and 11.71 showing that we can get 0(l/e)-approximate solutions connecting almost 
all the terminals connected by an optimal solution. Second, we use these theorems to complete the spanner 
constructions; details are provided in Appendix |Al Third, we give the remaining details of the reduction in 
Section 13.21 Finally, in Section HJ we give an exact algorithm for the prize-collecting Steiner tree problem in 
graphs of fixed treewidth. 

2 A Primal-Dual Algorithm for Prize-Collecting Steiner Forest 

Our algorithm is similar to the Goemans-Williamson lITSl primal-dual algorithm for prize-collecting Steiner 
tree. For the prize-collecting Steiner tree, this gives a 2-approximation (with some additional guarantees), but 
this does not appear possible in the Steiner forest variant. In this section, we give a simple 4-approximation for 
the prize-collecting Steiner forest problem, and later exploit properties of this algorithm to prove Theorem 1 1.7 1 

We give primal and dual linear programming formulations for the prize-collecting Steiner forest problem 
below. For each pair (sj, ti) the variable Zj is 1 if we pay the penalty for not connecting the pair, and otherwise; 
the variable Xe denotes whether the edge e is selected for the forest. We abuse notation and say that a set S 
separates (the terminal pair) i if it separates Si from ti. Let Si denote the collection of sets S that separate i. 
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A reader unfamiliar with the primal-dual algorithms of |T, TSl (for the Steiner forest and prize-collecting 
Steiner tree problems respectively) may wish to skip this paragraph and the next, proceeding directly to the 
description of our algorithm. Here, we briefly describe one way in which our algorithm differs from those of 
||Tl[l8j. A natural approach is to initially assign each terminal Si or ti a potential of approximately tTj, and make 
each one an active component. In both the Steiner forest and prize-collecting Steiner tree problems, a natural LP 
formulation has a single dual variable ys for each active component S. One increases the ys variable for each 
active component S uniformly, until either a component "runs out" of potential, or an edge becomes "tight". 
In the former case, the component is deactivated; in the latter, one merges the two components adjacent to the 
edge, and combines their potentials. 

In the prize-collecting Steiner forest problem, however, we have a collection of dual variables g for a 
single active component S. We still wish each active component S to grow at a uniform rate, but it is now not 
clear which pair (s,, ti) separated by S should pay for the growth of S. Below, we show that there is a natural 
way to share the cost of this growth among the pairs separated by S; once this is done, the analysis proceeds as 



Before describing the algorithm, we present some useful notation. The primal-dual algorithm begins with 
a growth phase, followed by a deletion phase. At all times, our algorithm maintains a forest F of edges. The 
connected components of F are labeled either active or inactive; we use C to denote the set of active components 
at any given time. Every component is active at the time it is formed; we use C to denote the set of components 
that were ever formed during the algorithm's execution. (Note that as we only add edges to F during the growth 
phase, C is a laminar family of components.) When a component S £ C is formed, we assign it a potential 
Potential (5) that corresponds (roughly) to the penalty of terminal pairs separated by S; thus, this measures how 
much we are willing to pay in order to connect terminals in S to their partners outside. If we grow S by more 
than its potential without meeting the desired partners, then it is more effective to pay the penalty than connect 
the pairs separated by S; at this point, we will mark S inactive. (Of course, some terminals in S may meet their 
partners while others do not; this is the key difference between Steiner tree and Steiner forest, and we describe 
how to make this intuition more precise below.) 

We form new components by adding edges to F, merging existing components; when two components 
merge, they combine their potentials. We say that S £ C unites i if 5 is the smallest active component in C 
containing both Sj and ti. Terminals are labeled satisfied, alive, or dead; initially, all terminals are alive. If a 
component S uniting i is formed, the terminals Si and ti are marked satisfied if they were both ahve immediately 
before the formation of S. (As satisfied terminals are no longer willing to pay for the growth of a component S 
containing them, we adjust Potential (5) if necessary; the procedure ProcessHistory handles the necessary 
bookkeeping.) Once we form a component 5 G C, we "grow" it by increasing an auxiliary dual variable y{S); 
simultaneously, we decrease Potential(S') to pay for this growth. Once Potential(5) becomes 0, a component 
is labeled inactive. (Recall that all components are active when they are formed.) When a component becomes 
inactive, all of its unsatisfied terminals are marked dead. (Intuitively, it is now more effective to pay the 



inllllllll. 



6 



associated penalties than to try to connect them to their partners.) For any terminal Sj (respectively ti), we use 
History(sj) (respectively History(tj)) to denote the set of components S e C such that S contains Sj (ti) and Sj 
(ti) was alive after S was formed. 

Finally, we note that the auxiliary variables y{S) do not exist in our dual LP formulation; instead, we have 
variables yi^s- We ensure that at the end of the algorithm, y(S) = seSi yi,s- order to determine how 
y(S) is split among the variables yi^s we maintain an associated variable Uncharged(S'), the uncharged growth 
of S. The following proposition is entirely straightforward, as we increase y(S) and Uncharged(S') together: 

Proposition 2.1. For each S E C, as soon as S is marked inactive or S becomes part of a larger component, 
we have Uncharged(S) = y{S). 

The procedure ProcessHistory ensures that Uncharged(5) (and hence y(S)) is split appropriately among 
the dual variables yi^s- We omit a proof of the proposition below: 

Proposition 2.2. When the main While loop o/Primal-Dual Forest terminates, for each pair of terminals 
(si, ti), we have called PROCESSHlSTORYfsj j and PROCESSHlSTORYftj j. 

We initialize the set of active components to be the set of terminals, and set Potential (sj) = Potential (tj) = 
7ri/2 for each 1 < i < h. 



Primal-Dual Forest 

C ^ {{si} :l<i<h}U {{ti}: 1 < i < h} 
For all 1 < i < /i: 

Potential ({.Sj}) = Potential ({tj}) = 7rj/2 
While there is an active component: {{Begin Main Loop)) 

Increase ys by A for each S e C until an edge goes tight, or a component uses all its potential. 

For each S eC: 

Potentials') ^ Potential (5) - A; Uncharged(S') Uncharged(S') + A 
If (edge e connecting , S2 goes tight) 
F ^ FU{e} 

C^{C\{Si,S2})U{SiUS2} 
y{Si U S2) ^ 0; Uncharged(S'i U ^2) ^ 
Potential (Si U ^2) ^ Potential (^i) + Potential (52) 
For each i such that U ^2 unites i: 

If (si is alive), Potentia^^i U ^2) ^ Potentia^Si U S2) - PROCESSHlSTORY(si) 

If (t, is alive), Potential(S'i U S2) ^ Potentials'! U ^2)- PROCESSHlSTORY(tj) 

If (si AND ti are alive), Mark Si, ti as satisfied 

Else, Mark Sj, ti as dead 
If (component S uses all its potential) 

C^C\{S} {{Mark S as inactive)) 

For each alive Sj € 5 : {{Similarly for each alive tj G S)) 



Mark Si as dead 
PROCESSHlSTORY(si) 

End While 



{{End Main Loop)) 
{{Deletion Phase)) 



For each edge e G F: 

Delete e if F — e does not separate any pair of satisfied terminals 
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PROCESSHlSTORY(gt): 

RemainingCharge(sj) ^ 7rj/2 

For each S € History (sj), in increasing order of size: 

If Uncharged (5) < RemainingCharge(sj): {{AH uncharged growth of S can be charged to Si)) 

RemainingCharge(sj) ^ RemainingCharge(si) — Uncharged(5) 

Vi^s ^ Uncharged(S') 

Uncharged (S") ^ 

Else: {{Charge as much as possible to Si)) 

Uncharged(5) ^ Uncharged(S') — RemainingCharge(si) 
Ui^s ^ RemainingCharge(sj) 
RemainingCharge(sj) ^ 
Return 
Return RemainingCharge(sj) 



Lemma 2.3. For each component S & C, y{S) = seSiVhS- For each dead terminal Si, we have 

J2seHi.stonis,) Vi,S = similarly, for each dead t„ J2seHLstoniu) Vi,S = 

Proof Sketch: It is easy to verify these statements by checking that the algorithm maintains the invariant that 
for each component S G C, Potential(S') + Y^g'eC-S'cs Uncharged(5') = X^Aiive iG5 D 

Lemma 2.4. The variables yi^s from the algorithm Primal-Dual FOREST correspond to a feasible dual 
solution for the LP Dual-PCSF. 

Proof: It is easy to verify that all constraints are satisfied: For any edge e, it is added to F once J2s-ee5{S) vi'^) ~ 
Ce, and subsequently there is no active component S such that e G ^(S). As y{S) = sgS- ^^-S^ we sat- 
isfy the associated constraint. Further, for any pair Si,ti, the procedure ProcessHistory guarantees that 

Es-.s^eSA^s yi,s < ^*/2, and similarly Es:s,<^s,ues Vi^s < ^T^/2. □ 

Theorem 2.5. If F denotes the forest returned by the algorithm Primal-Dual FOREST, then X^bgf + 
Si separated by f'^^ — 40PT, where OPT denotes the cost of an optimal prize -collecting Steiner forest. 
Proof: As OPT is upper bounded by the value of any feasible dual solution, it suffices to show 

eEF i separated by F Si: S&Si 

For any i to be separated by the forest F, either Sj or ti (or both) must have been marked dead, and thus 
from Lemma 1231 we have vTj < 2 X^^g^. yi,s- Hence it suffices to prove 

eeF i separated by F S&Si \ Si: S^S^ J \ i SeS^ 

We prove YlieeF Ce < 2 YliS vi^)' which, using the fact that y{S) = ^g^. yi^s> implies the inequahty 
above. One can now use the standard primal-dual proof technique of |[T][T8l. Since an edge e is added to F 
only when it becomes tight, we have Cg = J2s-ee5{S) ^l*^)' hence, the desired inequality is equivalent to: 

5;|J(5)nF|y(S)<2^y(S). 
s s 

To verify this inequality, we check that in every iteration of the while loop, the increase in both the left- 
and right-hand sides satisfies the inequality. Since y{S) increases only for components that are active in a given 
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iteration, and we raise y{S) uniformly by A for each S € C, this is equivalent to checking that in any iteration, 
Active s ^ — ■ "^1' where na denotes the number of components active in this iteration. 
Construct an auxiliary graph H by beginning with G{V, F), and shrinking each currently active and inactive 
component to a single vertex. Discard any isolated vertex of H corresponding to an inactive component. We 
now argue that the average degree in H of the vertices corresponding to active components is at most 2; this 
completes the proof. To bound the average degree of the vertices corresponding to active components, we note 
that the average degree of all vertices is less than 2 (as H is a. forest), and that each vertex corresponding to an 
inactive component has degree at least 2. If the latter were not true, there is an inactive component which is a 
leaf of H, incident to a single edge e. But no inactive component separates a satisfied terminal from its partner, 
and so the edge e would have been deleted from F in the Deletion Phase of Primal-Dual Forest. □ 

Running time: Prima-Dual Forest can be implemented in 0{nh + log n) time where h is the number 
of pairs with strictly positive penalties and n is the number of nodes in the graph. The only additional difficulty 
in the algorithm as compared to the primal-dual algorithmic framework of Goemans and Williamson [ 1 8 1 for 
prize-collecting Steiner tree and related problems is the ProcessHistory step. When two active components 
merge, for each pair Sjt, united by this merge, we have to go over the sets in History(sj) and History(tj). 
The sets in C form a laminar family on V and hence \C\ = 0{n) and thus PROCESSHlSTORY(sj) can be 
implemented in 0{n) time by considering each set in \C\. Similarly, PROCESSHlSTORY(tj). Since a pair is 
united at most once, and each terminal is marked dead at most once, the total work involved in processing the 
histories is 0{nh). Priority queues can be used to keep track of the events corresponding to edges becoming 
tight and components becoming inactive; this is very similar to the implementation in ifTSl and the total work 
involved is 0(n^ log n) time. 



3 The Reduction to Bounded Treewidth: Building Spanners 

Recall that the first step in building a spanner for the Steiner tree and forest problems was to construct a new 
plane graph G' in which (i) all terminals are on the outer face and (ii) the length of the outer face is 0(1) • OPT. 
This was done by splicing open an Euler tour of an 0(1) approximate solution in the original graph G, and 
converting the tour into the outer facejl In the prize-collecting versions, however, we do not know which 
terminals to connect, and it is not possible to find an 0(l)-approximate solution in which all the terminals are 
connected to the root (for Steiner tree) or to their partners (for Steiner forest). 

In Section ISTTl we prove Theorems 11.61 and 11.71 showing that we can find a solution of cost 0(l/e)OPT 
which connects "almost" all the terminals connected by any optimal solution. More precisely, the total penalty 
of terminals connected by an optimal solution but not by our 0(l/e)-approximate solution is at most eOPT. In 
Appendix |A1 we complete the construction of the spanner, using ideas from ifTTl lSl. 

3.1 Scaling Penalties to Capture Important Terminals 

We prove Theorem 11.71 which implies Theorem 1 1.61 as prize-collecting Steiner tree is a special case of prize- 
collecting Steiner forest. Given an instance / of the prize-collecting Steiner forest on a graph G{V, E), with Ce 
denoting the cost of edge e £ E and vTj the penalty for not connecting Sj to ij, we define a new instance /' as 
follows: The graph and edge cost functions are unchanged, but we scale the penalties so that the penalty for not 
connecting Si to ti is vr ■ = 27rj / e. 

Theorem 3.1. Let F* be any optimal solution to an instance I of prize-collecting Steiner forest, and let OPT = 

Ylie&F* ^e. + Y^i separated by F* '^i- ^' the f Ore St OUtput by algorithm PRIMAL-DUAL FOREST on the 

^In the case of Steiner forest, this is done separately for distinct trees in the 0(l)-approximate forest. 
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instance I' with penalties scaled as above. Let X denote the index set of the terminal pairs separated by F' but 
not by F*. Then, J2e€F' < 80PT/e, and J2i€X ^ eOPT. 

Proof: We first note that the cost of an optimal solution to /' is at most 20PT/e; simply use the forest F*, 
which pays 2/s times as much penalty for every separated pair as it did in /. Thus, as Primal-Dual Forest 
is a 4-approximation, we have J2eeF' '^e ^ 80PT/e. 

To prove that the total penalty of pairs in X is small, consider a Steiner forest instance defined on these 
pairs: As F* connects all the terminals in X to their partners, the cost of an optimal Steiner forest for X is at 
most OPT. Suppose, by way of contradiction, that J2iex ^* ^ eOPT, and hence that J2i(^x ^ 20PT. Now 
consider the following dual of a natural LP for the Steiner Forest instance induced by X: 

Dual-Steiner Forest(X) 

max zs 

S separating some isX 

XI ZS < Ce 
S: eG(5(5) 

ZS > 



(Ve) 
NS) 



Let yj 5 be the feasible solution to Dual-PCSF returned by Primal-Dual Forest on instance /'. Now, 
construct a dual solution to the LP Dual-Steiner Forest(X) as follows: For each set S separating some pair Sjtj 
with i ^ X, set zs = X^jgx yi,s ■ As X]s eG<5(S) X^j- se5 yi,s ^ from the feasibility of the solution to Dual- 
PCSF, we conclude that the dual variables zs correspond to a feasible solution of Dual-Steiner Forest(X). 

Thus, we have a feasible solution to Dual-Steiner Forest(X) of total value Z^jgx- seSi yi,s- each 
i £ X was not connected by F', and so we must have marked either Si or ti as dead. Hence, from Lemma 1231 
"l^ses y^'S — Kf^- T^^^^ is> the value of our feasible dual solution is at least X^jgx ^i/^ ^ OPT. By weak 
duality, the length of any Steiner forest for X must be greater than OPT. But F* is a Steiner forest for X of 
total length at most OPT, which is a contradiction. □ 

We can now prove Theorem 11.71 

Proof of Theorem ll.7t Let F* be an optimal solution to a given instance / of Steiner forest, and let OPT = 
J2eGF* + Si separated by F* '^i- Construct a forcst F by running algorithm Primal-Dual Forest on the 
scaled instance /'; from Theorem 13 . 1 1 above, the total length of edges in F is at most 80PT/e. If X denotes the 
terminal pairs separated by F but not by F* , the penalty paid by F is at most Yl,i separated by F* '^i + Yliex ^« — 
OPT + eOPT. Thus, F is a forest of total cost 0(8/e + (1 + e))OPT. 

It remains only to argue that there is a (1 + e) -approximate solution F' that connects a subset of the pairs 
connected by F. Let F*-^ denote a solution formed from F* by paying the penalty for any terminal pair in X; 
clearly, the cost of FZx is the cost of X added to J2iex "^i' which is at most OPT + eOPT. □ 

The first part of the spanner construction for the prize-collecting Steiner tree problem is now complete: As 
guaranteed by Theorem 11.61 find a tree T of cost 0(l/e)OPT such that there exists a (1 + e) -approximate 
solution only connecting terminals in T. Now form an Euler tour of T by duplicating edges, splice along this 
tour, and make the tour the outer face of a new graph G'. Now, we have a graph in which all (relevant) terminals 
are on the outer face, and the total length of the outer face is 0(l/e)OPT. The rest of the spanner construction 
proceeds along the lines of [11]; see Appendix lAl 

For the prize-collecting Steiner forest problem, however, more work is required. The forest F guaran- 
teed by Theorem 11.71 may have many components, which cannot be treated in isolation. As in [8], we use 
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a prize-collecting clustering scheme to merge some components of the forest. The intuition is that after this 
clustering, the remaining components are "far apart", and hence can be treated separately. The prize-collecting 
clustering algorithm is as follows: Contract each tree Tj of F to a single vertex Vi, to obtain a new graph 
G. We let vi,V2, ■ ■ ■ denote the vertices of G corresponding to the contracted trees Ti, T2, . . . of F; note 
that G additionally has the vertices of G not contained in any tree Tj. With each Vi, we associate a potential 
4>y^ = i Length (Tj). Now run the standard prize-collecting primal-dual algorithm as in lITSl . using potentials 
. . Initially, each vertex is an active component, with potential (py^ ; in each step, the algorithm decreases po- 
tentials of all active components uniformly until either a component runs out of potential, or an edge becomes 
"tight". In the former case, the component is marked as inactive. In the latter case, the two components adjacent 
to the edge are merged, and their potentials combined. This is similar to the algorithm Primal-Dual Forest 
in Section |2l but simpler, as we do not have the additional accounting necessary to handle terminal pairs that 
only wish to connect to each other. As in Primal-Dual Forest, the algorithm maintains a collection of dual 
variables y^, . 5 for each Vi and set S C V{G). A complete description of this algorithm is given in |i8J; we omit 
details from this paper. 

The first stage of the clustering algorithm terminates when all components are marked inactive. Let Fi 
denote the forest of tight edges selected by the algorithm after the first stage. In the second stage, we delete 
any edge e from Fi if it is the unique edge incident to an inactive component. Let F2 denote the set of edges 
remaining. 

Lemma 3.2 (HI). The total length of all edges in F2 is at most 2 ^ • ((>y. = |-Length(F), which is 0(^)OPT. 

We also use the following two technical lemmas. A graph H C G is said to exhaust a vertex Vi E V{G) if, 
for all S such that yv,,s > 0, H contains at least one edge of S{S). (Recall that Vi corresponds to the contracted 
tree Ti of F.) 

Lemma 3.3 (Lemma 10 of |8]). Let V be the set of vertices of G exhausted by a graph H. LENGTH > 

Lemma 3.4 (HI). Let H (1 G connect two vertices vi,V2 in distinct components of F2. Then, H exhausts at 
least one of vi,V2- 

Let T^, T^, . . . be the trees comprising the forest F2. We will now argue that these trees are sufficiently "far 
apart", and so we can treat them separately; we formalize this intuition in the rest of this sub-section. Recall 
that from Theorem 11.71 there is a (1 + e) -approximate solution (in the original graph G) that does not connect 
(that is, pays the penalty for) terminal pairs in distinct components of F, the forest returned by Primal-Dual 
Forest. Let F* denote this solution; Cost(F*) < (1 + e)OPT. We construct a set of prize-collecting Steiner 
forest instances, one for each tree of F2. In instance P , we have irj = iVi for each pair (sj, tj) connected 
by T^, and vr^ = for all other pairs. Let OPT-' denote the cost of an optimal prize-collecting Steiner forest to 
instance P; we prove the following theorem: 

Theorem 3.5 (Following H). OPV < (1 + e)CosT(F*). 

Given this theorem, we can separately solve each instance P; it is easy to see that if we obtain a p- 
approximation to each instance, combining them yields a solution of cost at most p(l + e)C0SJ{F*) = 
p{l + 0(e))OPT. But for each instance P , the tree contains all terminal pairs with non-zero penalty, 
and hence we can splice open the tree and convert the corresponding Euler tour into the outer face to obtain a 
graph in which the length of the outer face is bounded, and all relevant terminals are on this outer face. This 
allows us to proceed with the spanner construction as described in Appendix [A] 

Thus, it remains only to prove Theorem 13. 5[ this closely follows the work of lU, with some additional care 
needed because the cost of a forest includes both its length and penalty. 
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Proof of Theorem l33t Recall that F denotes the forest returned by Primal-Dual Forest, and F* is an 
optimal prize-collecting Steiner forest that pays the penalty for terminal pairs in distinct components of F. 
Trees in F were contracted to form vertices of G; we then ran the prize-collecting clustering algorithm to form 
trees , 7^, . . ., making up the forest F2. We wish to show that if OPT-' denotes the cost of an optimal solution 
for the instance P (induced by terminal pairs in tree of F2), then OPT-' < (1 + e)C0ST(F*). 

To prove this theorem, we construct a set V of trees {T^} such that each G P only connects terminal 
pairs in a single component of F2. Further, LENGTH(Tp) < (1 + e)LENGTH(F*), and every pair connected 
by F* is connected by some T^. Such a set V of trees clearly yields solutions to the instances P, proving the 
theorem. 

We now construct the desired set V. We begin by setting V to be the collection of trees in F*, and then 
modify it as follows. Let Vi be any vertex of G exhausted by F*; prune from F* all terminals in the tree Tj of 
F contracted to form Vi, and add the tree Tj to V. When this process terminates, each tree in V only connects 
pairs in a single component of F2. Suppose this were not true; all trees added to V clearly satisfy this condition, 
so it only remains to consider trees originally in F* (from which some terminals may have been pruned). But 
any tree T* of F* connecting two vertices of G in distinct components of F2 must exhaust one of these vertices 
(from Lemma 1X41 ). and hence the corresponding terminals should have been pruned from T*, which yields a 
contradiction. It is also easy to see that any terminal pairs connected by F* are also connected by some tree in 
V. 

To bound the cost of the trees in V, we simply show that the length of the trees added to V is at most 
e • Length (F*). Let Vi denote the vertex of G corresponding to the tree Tj of F, and V the set of ver- 
tices exhausted by F*. The length of the added trees is simply ^^^^y, LENGTH (Tj). From Lemma 1331 
Length(F*) > T.v,eV'^v^ = E^,,GV"LENGTH(^i)/e. Rearranging, we get X^^^^^, LENGTH (T,) < e- 

LENGTH(F*). ^ ^ ^ ^ 

Finishing the Spanner: We can now complete the spanner construction using the following theorem, implicit 
in the work of li lTI : 

Theorem 3.6. Let I be an instance of prize-collecting Steiner forest on a planar graph G. Let F* be an 
optimal solution to L Given a tree T spanning all terminals of I, for any fixed e > 0, there is a polynomial time 
algorithm to find a planar graph H G such that: (i) LENGTH < /(e) • LENGTH (T) /or some function f 
that depends (exponentially) on e and (ii) there is a solution to instance I in the graph H of cost no more than 
(1 + e)C0ST(F*) + e ■ LENGTH(r). 

The similar theorem stated in f\V\ for instances of the Steiner tree problem is slightly less general, though 
their proof technique can be used to show the theorem we state here. For completeness, we provide a proof in 
Appendix [A] 

3.2 Completing the Reduction 

In Section [3T1 we constructed a forest F2 by first running the algorithm Primal-Dual FOREST on a modified 
instance with scaled penalties, and then running the prize-collecting clustering algorithm of HI. We proved two 
useful properties of F2: In Lemma ll!2l we showed that Cost(F2) < (20/e^)OPT, and in Theorem 13.51 we 
argued that we could separately solve a prize-collecting Steiner forest instance induced by each tree of F2 
without increasing the cost significantly. (Formally, we showed that OPT-' < (1 -f- e)OPT.) 

Now, for each tree T-' of F2, construct a spanner for the instance P . (Recall that T-' spans all terminals in 
P .) Using a parameter e' = Theorem l3.6l guarantees a spanner for P such that (i) LENGTH(ff-') = 

f'{e) ■ Length(T^) for some function /' depending only on e, and (ii) OVT{Hj) < (1 + e')OVV + e' ■ 
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Length (T-^ ), where OPT{Hj) denotes the cost of an optimal prize-collecting Steiner forest in Hj. Hence, 

p3 20 

^OPT(Fj) < (l+e')X]^P'^''+'^'X]'-^^'^™('^^') - (l + 2e)0PT + — •— OPT = (l+3e)0PT (1) 

J J J 

where the second inequality follows from Lemma [l!2l and Theorem [331 

Thus, if we can obtain a ^-approximation to each instance P in the graph H^, we obtain a p{l + 3e)- 
approximation to the original prize-collecting Steiner forest instance. We use the following theorem of fT6l. 

Theorem 3.7 (Demaine, Hajiaghayi, Mohar [16]). Let G be any planar graph, and let k be any integer such 
that k > 2. The edges of G can be partitioned into k sets such that contracting any one of the sets results in a 
graph oftreewidth 0{k). Furthermore, this partition can be found in polynomial time. 

Proof of Theorem ll.lt Let be the spanner for instance P as constructed above. Set k = (1/e') • f'i^), 
where f'(e) is the function such that LENGTH (iJ-^) < /'(e)LENGTH(T-'). 

Let El, - ■ ■ , -Efc be the decomposition of the edges of that is guaranteed by Theorem l3.7l Let Ei* be the 
set of edges that has minimum length among the sets Ei, - ■ ■ ,Ek. We have 

, , LENGTH (i?-?) /'(e) • Length (T^) , , 
Length < — ^ — ^ < ^^--^ ^ — = e' ■ Length (T^). 

Let = /Ei*; that is, is the graph obtained from by contracting the edges in Ei*. We assign 
new penalties to terminal pairs of in the natural way: When we contract an edge uv into a single vertex w, 
we replace each terminal pair {u, x) with a new pair {w, x) with the same penalty as (n, x), and we similarly 
replace each terminal pair (f , y) with a new pair {w, y) with the same penalty as (f , y). Let OVT{H^) denote 
the cost of an optimal prize-collecting Steiner forest in Hi; it is obvious that OPT(i?-?) < OPT(iJ'^). 

Since Hi has treewidth at most k, if there is a p-approximation for prize-collecting Steiner forest in graphs 
of fixed treewidth, we can find a p-approximate forest Fi in Hi . We can then map Fi to a forest Fi in Hi 
using the edges in Ei*. By construction, 

C0ST(F-') < COST(FJ) + LENGTH(^i.) < pOVT{Hi) + e' ■ LENGTH(r^). 

Combining such solution Fi for each Hi , we find a forest of total cost pO'PT{Hi)+e' Y,j Length(T-' ). 
Using equation (1), the first term is at most p(l -|- 3e)0PT, and from the choice of e', the second term is at most 
eOPT. □ 

4 Prize- Collecting Steiner Tree in Graphs of Fixed Treewidth 

In this section, for any fixed integer /c > 2, we give a polynomial-time algorithm to optimally solve the prize- 
collecting Steiner tree problem in graphs of treewidth at most k — I. 

A tree decomposition of a graph G is a pair (T, B), where T = (/, F) is a tree, and B = {Bi | z G /} is a 
family of subsets of V{G) such that 

(1) U^eIB^ = V{G) 

(2) for every edge uv G E{G), there exists an i such that {u, v} C Bi 

(3) for every vertex v G V{G), the set of nodes {i G / | G Bi] forms a connected subtree of T 
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We refer to vertices of T as nodes, and to each set of i3 as a bag. The width of a tree decomposition (T, B) is 
the size of the largest bag Bi minus one. As shown in ifTOl . for any fixed k, there is a polynomial time algorithm 
(in fact a linear time algorithm) that constructs a tree decomposition of G of width at most k, or reports that G 
has treewidth greater than k. In the following, we assume that we have a ti^ee decomposition for G of width at 
most k — I, for some fixed k. 

A tree decomposition (T, B) is nice if the tree T is rooted and, for every node f € /, either 

(1) i has no children (i is a leaf node) 

(2) i has exactly two children ii,i2 and Bi-^ = Bi^ = Bi (i is a join node) 

(3) i has a single child j and Bj = Bj U {v} for some vertex v € V{G) (i is an introduce node) 

(4) i has a single child j and Bi = Bj — {v} for some vertex v G V{G) {i is a forget node) 

The following lemma is well-known and it is straightforward to prove. 

Lemma 4.1. There is a linear time algorithm that, given a tree decomposition for G, constructs a nice tree 
decomposition (T, B) of the same width. Moreover, the tree T has 0{\V\) nodes. 

4.1 A Dynamic Program for Prize-Collecting Steiner Tree 

We solve the problem using dynamic programming on a nice tree decomposition (T, B) of width k — 1. For 
each node z E /, let Vi be the set of all vertices appearing in the bags corresponding to the nodes of the subtree 
of T rooted at i. Let Gi be the subgraph of G induced by Vi. We start with an informal overview of the 
algorithm. For the purposes of exposition, we assume that there is only one optimal prize-collecting Steiner 
tree T* (if there are several solutions, we fix one of them). Additionally, we assume without loss of generality 
that the root vertex r is in the bag corresponding to the root node of T. Now fix a node i of T, and consider the 
graph Gi. Clearly, we would like to compute the subgraph F of T* that lies in Gi. In order to do so, we will 
specify some information about this subgraph F. More precisely, we will specify the subgraph H of T* that 
lies in G[Bi], and a partition a of the vertices in H induced by the connected components of F, i.e., each part 
of a consists of all the vertices in H that are in the same connected component of F. (Intuitively, a tells us that 
we need to connect each part using a tree of Gi and all of these trees are guaranteed to be connected to the root 
outside Gi.) It follows from the optimality of T* that F is a minimum cost subgraph of Gi satisfying 

(ci) F[B,] = H 

(c2) the partition of V{H) induced by the connected components of is a refinement of a (if two vertices 
n, V are in the same connected component of H, then u and v are in the same part of a) 

(cs) the partition of V{H) induced by the connected components of F is a 

Let c{i, H, a) be the minimum cost of a subgraph F of Gi satisfying (ci) — (cs). We will compute c{i, H, a) 
for all valid tuples {i, H, a) using dynamic programming. The cost of the optimal prize-collecting Steiner tree 
is equal to minH,a c{r', H, a), where r' is the root node of T, and the minimum is over all pairs {H, a) such 
that H is a. subgraph of G[Br'] containing r, and a has a single part containing the vertices of H. To see why 
this is true, consider a pair {H, a) such that H is a. subgraph of containing r, and a consists of a single 

part containing the vertices of H. If T is a solution for the subproblem (r', H, a) then T is a tree that contains 
r, and hence T is a valid prize-collecting Steiner tree. Conversely, let T be a prize-collecting Steiner tree. Let 
H = T[Br'] and let a be the partition of V{H) induced by T. Since T is a tree containing r, H contains r 
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and a consists of a single part containing the vertices of H. Therefore T is a vahd solution for the subproblem 

{r',H,a). 

Let i be a node of T. Then i is a leaf node, a join node, an introduce node, or a forget node, and we consider 
each of these cases separately. Before we describe the recurrence for c{i,H,a), we introduce some useful 
terminology (borrowed from |8|). 

We can view a partition a as an equivalence relation over the vertices, and we write u =a v if u and v are 
in the same part of a. Let ai and 02 be two partitions of the same vertex set. We say that ai is finer than 02 — 
or equivalently, that ai is a refinement of 02 — ifu =q,^ v implies u =ct2 ^- If finer than olii we say that 
Q!2 is coarser than a\. We use a\ V 02 to denote the finest partition that is coarser than both a\ and a2 (there 
is a unique such partition). 



Node I is a leaf node. Let /? be the partition of Viji) induced by the connected components of H. We have 

{ 



(LENGTHfi?) + PENALTYf^i - V [H)) if a = /3 
c(z, li,a) = \ . (2) 

00 otherwise 



Proof of Equation |2) Since Gi = G\B^, H is the only subgraph satisfying (ci). If a / /3, there is no 
subgraph satisfying (ci) — (03). Otherwise, H is the only subgraph satisfying (ci) — (03) and its cost is 
Length (F) + Penalty (ff). □ 



Node i is a join node. Let ii and 12 be the children of i. We have 

c{i, H, a) = min {c{ii,H, ai) + c{i2,H, 02) - Length(F)) (3) 

ai,a2 

where the minimum is taken over all partitions qi, 02 of V{H) such that q = ai V Q2. 



The intuition behind Equation [3]is the following. Let F, Fi, F2 denote the restrictions of the optimal tree T* to 
Gi, Gi^, Gi^ (respectively). Then F is the union of Fi and F2. Let a, ai, 02 be the partitions of Bi n V{T*) 
induced by the connected components of F, Fi, F2. Since Fi and F2 intersect only at Bir\V{T*), a = qi Va2. 

Formal proofs of correctness for this, and subsequent cases, can be found in Appendix |B] 



Node i is a forget node. Let j be the child of i, and let v be the vertex in Bj — B^. Fix a subgraph H of 
G[Bi], and a partition a of V{H). Let 5 be a subset of the neighbors of v that are in B^. Let E{v, S) denote 
the edges with an endpoint in v and the other in S. Let a{v, S) be the partition of V{H) U {v} obtained from 
a as follows: we merge each part of a that contains a vertex in S into a single part and add v to it; we add all 
remaining parts of a to a{v,S). We have 

c(i, a) = min ^c(j, H, a), min ^ ^ (c(j, H U {v} U E{v, S), a{v, S))^ (4) 

where the second minimum is taken over all sets S C V{H) n r{v) such that S has at most one vertex in each 
part of a. 



The intuition behind Equation |4] is the following. Let F, F' denote the restriction of the optimal tree T* to 
Gi, Gj (respectively). If T* does not contain v, we have F' = F. Therefore we may assume that T* contains 
V, and thus F' consists of F and the edges of E(T*) n E{Gj) that are incident to v. The edges of F' that are 
incident to v have at most one endpoint in each connected component of F. Thus each connected component 
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of F' that does not contain u is a connected component of F, and the connected component of F' containing v 
consists of one or more connected components of F that connect to each other via the edges incident to v. 



Node i is an introduce node. Let j be the child of i, and let v be the vertex in Bi — Bj. Let S be the set 
of all neighbors u of v such that the edge uv is in H. For each partition a' of V{H) — v, we let a'{v, S) be 
the partition of V{H) obtained from a' as follows: we merge each part of a' that contains a vertex in S into a 
single part and add v to it; we add all remaining parts of a' to a'{v, S). We have 

f cij,H, a) + Penalty (v) ifv^V{H) 
c{i,H,a) = s , , (5) 

I miriQ/ (c(j, H — V, a ) + J2uveH LENGTH(nu)) otherwise 

where the minimum is taken over all partitions a' of V{H) — {v} satisfying 

(i) S has at most one vertex in each part of a' 

(a) a'{v, S) = a 
(Note that there exists a partition a' that satisfies the conditions above.) 



The intuition behind Equation [5] is the following. Let F, F' denote the restriction of the optimal tree T* to 
Gi, Gj (respectively). If T* does not contain v, we have F = F' . Therefore we may assume that T* contains 
V, and thus F consists of F' together with the edges of E{T*) n E{G[Bi]) that are incident to v. The edges of 
F that are incident to v have at most one endpoint in each connected component of F' . Thus each connected 
component of F that does not contain v is, & connected component of F' , and the connected component of 
F containing v consists of one or more connected components of F' that connect to each other via the edges 
incident to v. 

Proof of Theorem ll.2t Let be the number of partitions of a A;-element sej^ and let s^. be the number of 
subgraphs of a graph with k vertices. Since each bag has at most k vertices and T has 0(|y |) nodes, there are 
• 6fc ■ Sk) distinct subproblems. Additionally, we can evaluate each subproblem in 0{b\) time once we 
have a solution for each of the subproblems it depends on. (The most expensive evaluation corresponds to a 
join node.) Therefore we can find an optimal prize-collecting Steiner tree in 0{h\ ■ Sk • \ V\) time. □ 
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A Constructing the Spanner: A Proof of Theorem 1X6 



We begin by duplicating the edges of T, and introducing multiple copies of its non-leaf vertices in order to 
transform the Euler tour corresponding to T into a cycle. Let G' by the resulting graph. We then make this 
cycle the outer face A of G'. 

Definition A.l (Definition 6.2, Borradaile et al. ifTTI ). A path P is e-short in G' if for every pair of vertices 
X and y on P, the distance from x to y along P is at most (1 + e) times the distance from x to y in G' (i.e., 

distp{x,y) < {1 + e)distG'{x,y)). 

Strips. Let A[x, y] denote the subpath of the outer face A from x to y. We find a pair of vertices x,y on A 
such that A[x, y] is a minimal subpath of A that is not e-short in G'. Let be a shortest path from x to y in 
G'. The subgraph enclosed by A[x, y] U iV is a strip. We recursively decompose the subgraph of G' enclosed 
by U (A — A[x, y]) into strips, if the graph is nontrivial. 

Lemma A.2 (Lemma 6.3, Borradaile et al. HI in . The total length of all the boundary edges of all the strips is 
at most {e^^ + 1) • Length(A). 

Columns. Consider a strip, with north and south boundaries N and S (N is the shortest path we added when 
we created the strip). We select vertices sq, si, • • • on 5 and paths Co, Ci, . . . inside the strip as follows. The 
vertex sq is the left endpoint common to S and A^, and column Co is the (empty) shortest path from sq to N. 
Now suppose that we have selected vertices so, si, . . . , Si-i and columns Co, Ci, . . . , Cj_i. The vertex Sj is 
the first vertex on S such that the distance from Si_i to Si on S is greater than e times the distance from Sj to 
A^ in the strip, and the column Cj is the shortest path in the strip from Sj to A^. 

Lemma A.3 (Lemma 6.4, Borradaile et al. Ullll ). The sum of the lengths of the columns in a strip is at most 
Length (S) /e, where S is the south boundary of the strip. 
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Supercolumns. Let 



£2 ye 

For each strip, we select a subset of the columns {Co, Ci, . . . } of the strip as follows. Let 

Ci = {Cj \ j = i mod k} 

where < i < A; — 1. Let i* be the index that minimizes LENGTH(Ci). We designate the columns in Ci* as the 
supercolumns of the strip. 

Lemma A.4 (Lemma 6.5, Borradaile et al. [111). The sum of the lengths of the supercolumns in a strip is at 
most 1/k times the sum of the lengths of the columns in the strip. 

Lemma A.5. The sum over all strips of the length of all the supercolumns is at most e ■ Length(A). 
Proof: By Lemma |AT2] and Lemma |AT3l the total length of the columns is at most 

- ( - + 1 ) Length(A) 



e \e 

By Lemma IA!41 the total length of the supercolumns is at most 

Y ■ (total length of columns) < ^ • - f - + 1 ) Length(A) = e ■ Length(A) n 

k k e \e J 

Mortar Graph. The mortar graph MG is a subgraph of the original graph G consisting of the edges of the 
given tree T (that was doubled to form the outer face A of G'), the edges of the shortest paths that define the 
strips, and the edges of the supercolumns. 

Lemma A.6. The length of the mortar graph MG is at most (| + e) • LENGTH (A). 

Proof: The total length of the strips is at most (i + l) LENGTH (A). The total length of the supercolumns is at 
most e ■ Length(A). The length of T is precisely half the length of A, which consisted of two copies of each 
edge of T. Thus, the total length of MG is at most (i + 1.5 + e) ■ Length(A) □ 

Proposition A. 7. The mortar graph MG contains every vertex ofT. 

Bricks. A brick consists of all edges of the original graph G that are (strictly) enclosed by the boundary of 
some face / of the mortar graph. (Note that if an edge e on the outer face of G is not a part of MG, it is 
"enclosed" by the outer face of MG.) For each face / of the mortar graph that encloses at least one edge, there 
is a corresponding brick. 

Lemma A. 8 (Lemma 6.10, Borradaile et al. Illlll ). The boundary dB of a brick B, in counterclockwise order, 
is the concatenation of four paths Wb, Sb, Eb, Nb such that 

1. The set of edges of B — dB is non-empty. 

2. Every vertex ofT that is in B is on Sb or Nb- 

3. Nb is 0-short in B, and every proper subpath of Sb is e-short in B. 

4. There exists a number k' < k and vertices sq, si, • • • , Sfc' ordered west to east on Sb such that, for each 
i and each vertex x on SB[si, Sj+i), distsg {x, Si) < e ■ distBix, Nb). 
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Definition A.9 (Definition 10.3, Borradaile et al. ifTTl ). Let H be a subgraph of G such that P is a path in H. 
A joining vertex of H with P is a vertex of P that is the endpoint of an edge of H — P. 

Lemma A.IO (Tiieorem 10.7, Borradaile et al. IfTTl ). Let B be a plane graph with boundary W VJ SVJ E\J N, 
satisfying the brick properties of Lemma I A. 81 Let F be a set of edges of B. There is a forest F of B with the 
following properties: 

L If two vertices of N L) S are connected in F then they are connected in F. 

2. The number of joining vertices of F with both N and S is at most a{e), where a{e) = o{e~^'^). 

3. Length(F) < (1 + ce)LEJ^GTli{F), for some fixed constant a. 

Portals. Let 9 = 9{e) be a parameter that depends polynomially on For each brick B, we designate some 
vertices of dB as portals, evenly spaced around B as follows. Let vq G dB be the endpoint of an edge strictly 
enclosed by dB; we designate vq as a portal. Now suppose we have designated vq,vi, . . . , Vi-i as portals. Let 
Vi be the first vertex on dB such that LENGTH (55 [f i;^]) > LENGTH 76*. We designate Vi as a portal, 
unless vq G V{dB{vi^i,Vi]), in which case we stop. 

Lemma A. 11 (Lemma 7.1, Borradaile et al. [11]). For any vertex x on dB, there is a portal y such that the 
x-to-y subpath of dB has length at most LENGTll{dB) /6. 

Lemma A.12 (Lemma 7.2, Borradaile et al. ifTTI ). There are at most 9 portals on dB. 

Portal-connected graph. For any subgraph G" of the mortar graph MG, we construct a planar graph ]3~^{G") 
as follows. For each face / of G' corresponding to a brick B, we embed a copy of B inside the face /, and, 
for each portal v of B, we connect the copy of v in the brick with the copy of w on / using a zero-length edge. 
We refer to these zero-length edges as portal edges, and we refer to B^{MG) as the portal-connected graph. 
Finally, any new vertex receives penalty zero (these vertices are copies of vertices of the mortar graph). 

Theorem A. 13. Let F* be an optimal Prize-Collecting Steiner forest in G. There exists a constant 9 = 9{e) 
depending polynomially on 1 /e such that, for any choice of portals satisfying the condition in Lemma \A.ll\ the 
corresponding portal-connected graph B^{MG) contains a forest F with the following properties: 

L Length(F) < (1 + cie)LENGTH(F*) + C2e • Length(A), where ci, C2 are absolute constants 

2. Any two vertices of MG in the same component of F* are connected by F. 
Proof: Let F* be an optimal tree to the Prize-Collecting Steiner forest problem in G, and let S* be the set of 
all vertices of the mortar graph MG that are in F* . We will follow the proof of Theorem 3.2 in Borradaile 
et al. [11 J ; there are two main steps. First, we transform F* into a solution in G that only has a few joining 
vertices in each brick. To convert this into the desired forest F in the portal-connected graph B^{MG), we 
simply add edges connecting the joining vertices in each brick to the nearest portals. As there are not many 
joining vertices, we can connect them to the portals without significantly increasing the cost. We describe the 
process completely below. 

First, we add the east and west boundaries of each brick; let Fi be the union of F* with the east and west 
boundaries {Eb and Wb) for each brick B. By Lemma [a31 Length(Fi) < Length(F*) + e-LENGTH(A). 
Clearly, Fi connects all vertices of 5* connected by F*. 

Next, we reduce the number of joining vertices on the north and south boundaries of each brick. Let F^i^ 
be the subgraph of Fi that is strictly embedded in a brick B of G. We replace Fi\b with the forest F2\b that is 
guaranteed by Lemma lA. 101 We have 

LENGTH(F2|b) < (1 + Cie)LENGTH(Fi[B) 
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Let N and S denote the north and south boundaries of the brick. Since any two vertices of U that are 
connected in Fi^^ ^re also connected in F2\b^ it follows that F2\b connects all vertices of S* connected by 

Fl\B- 

We apply this procedure for each brick in order to get a subgraph F2. Since the bricks are disjoint, 
LENGTH(F2) < (1 + Cie)LENGTH(Fi) 

< (1 + cie) (Length(F*) + e ■ Length(A)) 

= (1 + cie)LENGTH(F*) + (e + cie^)LENGTH(A) 

Moreover, F2 connects all vertices of S* connected by Fi . 

Now we convert the forest F2 C G to a subgraph of B^{MG). Note that every edge of G has at least one 
corresponding edge in i3+(MG) (an edge e of MG has three copies: one mortar edge, and one inside each of 
the bricks corresponding to the two faces of G' incident to e). For each edge e of F2, we select a corresponding 
edge of B~^{MG) as follows. If e is an edge of MG, we select the corresponding mortar edge of I3'^{MG). 
Otherwise, we select the unique edge corresponding to e in B^{MG). Let F3 denote the resulting subgraph of 
B+{MG). We have: 

LENGTH(F3) = LENGTH(F2) < (1 + Cie)LENGTH(F*) + (e + Cie^)LENGTH(A). 

Since F3 does not connect the connected components of F2, we connect it using portal edges and mortar 
edges as follows. Consider a brick B, and let Vb denote the set of joining vertices of F3 with Nb U Sb- For 
each vertex v € Vb, let be the portal vertex that is closest to v, let Py be the shortest v-to-p^ path along dB, 
and let be the corresponding path of mortar edges. Let e„ be the portal edge corresponding to p^. We add 
Py, P^, and By to F3. We apply this procedure for each brick in order to get the subgraph F. First, we bound 
the length of F. 

Length (F) < Length (F3) + ^ ^ (Length (P^,) + Length (e^,) + Length (P^)) 

B vGVb 

= LENGTH(F3) + 2^ ^ LENGTH(Pu) [LENGTH(e) = 0, Length(P^) = Length(Pi,)] 

B v&Vb 

< Length(P3) + 2^ ^ LENGTH(aP)/6l(e) [LemmalAlll 

B v€Vb 

< Length(F3) + 2^a(e)LENGTH((9P)/6i(e) [Lemma|AJOl 

B 

= Length(F3) + • Length(5(P)) 

Aa(e) 

< Length (F3) + ■ Length (MG) 

IQa(e) 

< Length(F3) H ■ Length(A) [Lemma IA^ 

Setting 6{e) = 16e~'^a{e) gives us 

Length(F) < Length(F3) + e • Length(A) 

< (1 + cie)LENGTH(F*) + (2 + cie)e ■ Length(A) 
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It remains only to show that F connects any two vertices of S* connected by F2, and hence by F*. Let x 
and y be two vertices of S* connected by F2 via an x — y path P in F2. The definition of F3 breaks P into 
disjoint paths. Consider one such path Pi that is not a subpath of MG. By construction, the endpoints of Pi are 
joining vertices. When we construct F from F3, we connect the endpoints of Pi to their corresponding vertices 
on MG via portal edges. Therefore there is an x — y path in F. □ 

Spanner. For each brick B and for each subset X of the portals of B, we find an optimal Steiner Tree for B 
and X. The spanner H consists of all edges of these Steiner Trees together with the edges of the mortar graph 
MG. 

Lemma A.14. The total length of the spanner H is at most (1 + 2^+^)(| + e)LENGTH(A). 

Proof: As shown in Lemma 4.1 of HIl, the total length of all Steiner trees is at most 2^+" • LENGTH(AfG). 
Thus the length of H is at most (1 + 2^+^) • Length(MG). Lemma lAT6l completes the proof. □ 

Lemma A. 15. The spanner H contains a prize-collecting Steiner forest F' of cost at most (l+cie)LENGTH(F*)+ 
C2e ■ Length (A) + Penalty (F*), /or some absolute constants ci,C2- 

Proof: We will follow the proof of Lemma 4.2 in Borradaile et al. [11]. Let F* be an optimal forest in G and 
let F be the forest guaranteed by Theorem lA. 131 For each brick B and for each connected component K of the 
intersection of F with B, let X be the set of portals of B belonging to K; we replace K with the optimal Steiner 
Tree for B and X contained in the spanner. Let F be the subgraph resulting from all these replacements. We 
have 

Length(F) < Length(F) < (1 + cie)LENGTH(F*) + C2e ■ Length(A) 

Moreover, since F connects all vertices of MG connected by F*, and all terminals are vertices of MG as 
they are connected by T, it follows that F also connects all terminals connected by F*. Hence, Penalty (F) < 
Penalty (F*); as Cost(F) = Length (F) + Penalty (F), we obtain the lemma. □ 

Theorem[3^now follows almost directly from Lemmas lA. 14l and lATT5l as LENGTH (A) = 2 • LENGTH (T) ; 
simply construct the spanner H using a modified parameter s' = maxjci 202} ' 



B Omitted Proofs from Section 4.1 



Proof of Equation |3j Let (ai, 02) be a pair that minimizes the right hand side of Equation [3l Let F^ be an 
optimal solution for the subproblem {i£,H,a£), where i = 1,2. Let F = Fi U F2. Now we claim that F is a 
solution for the subproblem (i, H, a), i.e., it satisfies the conditions (ci) — (03). Clearly, F satisfies (ci) and 
(C2). Now let u and v be two vertices in the same part of a. Since a = ai V 02, u and v are in the same part 
of ae for some £, and thus u and v are in the same connected component of Fi. Thus the partition of V{H) 
induced by F is coarser than a. Since a is coarser than ai and 02, it follows that a is coarser than the partition 
of V{H) induced by F as well. Therefore F satisfies (03). Since E{H) C F(Fi) n F(F2), 

Length(F) < Length(Fi) +Length(F2) -Length(F) 

Since V{F) = V{Fi) U V{F2) and V{Gi) = V{Gi,) U V{Gi^), we have 

PENALTY(y(Gi) - V{F)) < Penalty (y(G'i J - V{Fi)) + Penalty (^(GiJ - V{F2)) 

Thus 

c{i,H,a) < Cost(F) < Cost(Fi)+Cost(F2)-Length(F) = c(ii, i?,ai) + c(i2,-H', 02)- Length (F) 
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Conversely, let F be an optimal solution for the subproblem (i, H, a). Let Fi be the restriction of F to Gi^, 
where £ = 1,2. Let a£ be the partition induced by the connected components of F^. Now we claim that Fi is 
a solution for the subproblem H, ag): by construction, Fe satisfies (ci) — (cs). Since -Fi and F2 intersect 
only at V{H), a = ai V 02- Therefore the right hand side of the equation is at most 

COST(Fi) + C0ST(F2) - LENGTH(iJ) < Cost(F) = c(i, H, a) 

which completes the proof. □ 



Proof of Equation |4t Suppose the minimum of the right hand side of the equation is achieved by an optimal 
solution F' for the subproblem (j, H, a). Since v is not in H, F' does not contain v. Thus F' is a solution for 
the subproblem (i, H, a), and therefore c{i, H, a) is at most the right hand side of Equation IH Therefore we 
may assume that the minimum of the right hand side is achieved by an optimal solution F'g for the subproblem 
(j, H [J {v} U E{v, S), a{v, S)). Let F = F^ - E{v, S) - {v}. Now we claim that F is a solution for the 
subproblem (i, H, a). By construction, F satisfies (ci) and (02). Therefore it suffices to show that F satisfies 

Note that we may assume without loss of generality that Fg is a forest. Now suppose that F'g has an edge 
e whose endpoints are in different parts of a. Since e is not incident to v, it follows that e is in H. But the 
partition of V{H) induced by the connected components of is a refinement of a, which is a contradiction. 

Let u and w be two vertices in the same connected component of F. It follows that the unique path of 
F'g between u and w does not pass through v, and hence u and w are in the same part of a (since otherwise 
the path between u and w has an edge with both endpoints in different parts of a). Therefore the partition of 
V{H) induced by the connected components of F is a refinement of a. Conversely, let u and w be two vertices 
contained in the same part of a. Since a(f , S) is coarser than a, u and w are in the same connected component 
of F'g. Let P be the unique path in F'g between u and w. If P — t; is a path, u and w are connected in F. 
Therefore we may assume that v is an internal vertex of P. Let u', w' be the two neighbors of v on P, where u' 
is on the subpath of P from u to v. Since there is a path between u and u' in F (namely, the subpath of P from 
u to u'), it follows from the previous argument that u and u' are in the same part of a. Similarly, w and w' are 
in the same part of a. Therefore S has two vertices in the same part of a, which is a contradiction. Thus a is a 
refinement of the partition of V{H) induced by the connected components of F. It follows that F satisfies (03) 
as well, and hence c(i, H, a) is at most the right hand side of the equation. 

Conversely, let F be an optimal solution for the subproblem {i, H, a). Since F is also a solution for the 
subproblem (j, H, a), it follows that c(i, H, a) is at least the right hand side of the equation. □ 

Proof of EquationUt Suppose that v is not in H. Let F be an optimal solution for the subproblem (z, H, a). 
Since v is not in H, F is a solution for the subproblem (j, H, a) as well, of cost 

Length(F) + Penalty - V{F)) = Length(F) + PENALTY(y(Gi) - V{F)) - Penalty(w) 

= c(i, H, a) - Penalty (f) 

Thus 

c(i, H, a) > c{j, H, a) + PENALTY (u) 
Conversely, let F be an optimal solution for the subproblem (j, H, a). Then F is a solution for (i, H, a) of cost 

LENGTH(F) + PENALTY(l/(Gi) - V{F)) = LENGTH(F) + PENALTY(y(Gj) - V{F)) + PENALTY(w) 

= c{j, H, a) + Penalty (f) 

Thus 

c{i, H, a) < c{j, H, a) + PENALTY (i;) 
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Therefore we may assume that v is in H. Let a' be a partition of V{H) — {v} satisfying the conditions above, 
and let F' be an optimal solution for the subproblem (j, H — v, a'). Let F = F' VJ E{v, S) U {v}, where 
E{v, S) is the set of all edges of H that are incident to v. Now we claim that F is a solution for the subproblem 
{i, H, a). By construction, F satisfies (ci) and (02). Therefore it suffices to verify that F satisfies (03). 

Let u and w be two vertices in the same connected component of F. Suppose that u and w are connected in 
F'. Then u and w are in the same part of a' and, since a is coarser than a', u and w are in the same part of a. 
Therefore we may assume that u and w are not connected in F'. Thus u and are in different parts of a', each 
of which contains a vertex in S. It follows that the two parts have merged into a single part of a'{v, S) = a, 
and hence u and w are in the same part of a. Conversely, let u and w be two vertices in the same part of a. If 
u and w are in the same part of a', it follows that u and w are connected in F'. Therefore we may assume that 
u and w are in different parts Pi and P2 of a', each of which contains a vertex of S. Let u' and w' be the two 
vertices of Pi n S, P2 H S. Then there exists a path in F' from u to vf, and a path from w to w'. These two 
paths together with the edges u'v, vw' form a connected subgraph of F. It follows that u and w are connected 
in F, and hence F satisfies (03). 

We have 

Length (F) = Length (F') + ^ Length (-uv) 

uv^G 

Since V{F) = V{F') U {v} and y(Gi) = U {v}, 

PENALTY(F(Gi) - F(F)) = Penalty - ^(F')) 

Thus 

c(i, iJ, a) < c(j, H -v,a')+ ^ Length(ui;) 

Conversely, let F be an optimal solution for the subproblem (i, H, a). Without loss of generality, F is a forest. 
Let F' = F — V, and let a' be the partition of V{H) — {v} induced by F'. Since F is a forest, v has at most one 
neighbor in each part of a'. Now we claim that a'{v, S) = a. Let T be any connected component of F that does 
not contain v. It follows that T is a connected component of F' as well. Since the partition of V{H) induced by 
the connected components of F is equal to a, a' contains each part of a that does not intersect S. Now consider 
the connected component T of F that contains v, and let Ti, • • • , be the connected components ofT — v. 
Since each Tj is a connected component of a', it follows that the part of a'{v, S) containing v can be obtained 
by merging the parts of a' induced by Ti, • • • ,T£ into a single part, and adding v to it. Thus a'{v, S) = a. 

Now we claim that F' is a solution for the subproblem (j, H — v, a'). By construction, F' satisfies (ci) and 
(C2). Additionally, it follows from the definition of a' that F' satisfies (C3). We have 

Length (F) = Length (F') + ^ Length (-ui;) 

uv&H 

As before. 

Penalty - V{F)) = Penalty (F(Gj) - V{F')) 

Thus 

c{i, H, a) > c{j, H -v,a') + ^ LENGTH('ut;) 

uv&H 

□ 
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